Learning from examples with unspecified attribute values

نویسندگان

  • Sally A. Goldman
  • Stephen Kwek
  • Stephen D. Scott
چکیده

We introduce the UAV learning model in which some of the attributes in the examples are unspecified. In our model, an example x is classified positive (resp., negative) if all possible assignments for the unspecified attributes result in a positive (resp., negative) classification. Otherwise the classificatoin given to x is "?" (for unknown). Given an example x in which some attributes are unspecified, the oracle UAV-MQ responds with the classification of x. Given a hypothesis h, the oracle UAV-EQ returns an example x (that could have unspecified attributes) for which h(x) is incorrect. We show that any class learnable in the exact model using the MQ and EQ oracles is also learnable in the UAV model using the MQ and UAV-EQ oracles as long as the counterexamples provided by the UAV-EQ oracle have a logarithmic number of unspecified attributes. We also show that any class learnable in the exact model using the MQ and EQ oracles is also learnable in the UAV model using the UAV-MQ and UAV-EQ oracles as well as an oracle... Read complete abstract on page 2.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Comparison of Various Routines for Unknown Attribute Value Processing The Covering Paradigm

Simple inductive learning algorithms assume that all attribute values are available. The well-known Quinlan's paper [Qui89] discusses quite a few routines for processing of unknown attribute values in the TDIDT family and analyzes seven of them. This paper introduces five routines for processing of unknown attribute values that have been designed for the CN4 learning algorithm, a large extensio...

متن کامل

The Distribution Approximation Approach to Learning from Aggregated Data

This report describes four very simple methods for preparing training data sets for machine learning when training examples are split to several subsets, and only aggregated values of attributes are available for each subset. Specifically, each subset of examples is represented by a vector of pairs of values for each attribute. The first value in the pair is the mean, and the second is the stan...

متن کامل

Mining from incomplete quantitative data by fuzzy rough sets

Machine learning can extract desired knowledge from existing training examples and ease the development bottleneck in building expert systems. Most learning approaches derive rules from complete data sets. If some attribute values are unknown in a data set, it is called incomplete. Learning from incomplete data sets is usually more difficult than learning from complete data sets. In the past, t...

متن کامل

Designing a model of intuitionistic fuzzy VIKOR in multi-attribute group decision-making problems

Multiple attributes group decision making (MAGDM) is regarded as the process of determining the best feasible solution by a group of experts or decision makers according to the attributes that represent different effects. In assessing the performance of each alternative with respect to each attribute and the relative importance of the selected attributes, quantitative/qualitative evaluations ar...

متن کامل

Learning cross-level certain and possible rules by rough sets

Machine learning can extract desired knowledge and ease the development bottleneck in building expert systems. Among the proposed approaches, deriving rules from training examples is the most common. Given a set of examples, a learning program tries to induce rules that describe each class. Recently, the rough-set theory has been widely used in dealing with data classification problems. Most of...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Inf. Comput.

دوره 180  شماره 

صفحات  -

تاریخ انتشار 2003